National Repository of Grey Literature 4 records found  Search took 0.01 seconds. 
Automatic Delivery Note Transcription
Necpál, Dávid ; Kišš, Martin (referee) ; Hradiš, Michal (advisor)
This bachelor thesis aims to create a system for automatic transcription of delivery notes - documents with a fixed structure. The solution is divided into two parts. The first part is table lines detection and subsequent detection and extraction of cells, that contain required data. The second part is handwritten numeric characters recognition in the images of the cutted cells. The resulting system can detect cells with the required data with 100 % accuracy with well-scanned delivery notes, while the success rate of numerical character recognition is more than 95 % for individual characters and more than 92 % for entire character sequences. The benefit of this work is a system for automatic transcription of delivery notes, which provides faster and easier otherwise lengthy rewriting of the contents of delivery notes to the information system in the retail. By using this system, the employee saves more than 50 % of the time on each delivery note.
Automatic tagging of musical compositions using machine learning methods
Semela, René ; Galáž, Zoltán (referee) ; Kiska, Tomáš (advisor)
One of the many challenges of machine learning are systems for automatic tagging of music, the complexity of this issue in particular. These systems can be practically used in the content analysis of music or the sorting of music libraries. This thesis deals with the design, training, testing, and evaluation of artificial neural network architectures for automatic tagging of music. In the beginning, attention is paid to the setting of the theoretical foundation of this field. In the practical part of this thesis, 8 architectures of neural networks are designed (4 fully convolutional and 4 convolutional recurrent). These architectures are then trained using the MagnaTagATune Dataset and mel spectrogram. After training, these architectures are tested and evaluated. The best results are achieved by the four-layer convolutional recurrent neural network (CRNN4) with the ROC-AUC = 0.9046 ± 0.0016. As the next step of the practical part of this thesis, a completely new Last.fm Dataset 2020 is created. This dataset uses Last.fm and Spotify API for data acquisition and contains 100 tags and 122877 tracks. The most successful architectures are then trained, tested, and evaluated on this new dataset. The best results on this dataset are achieved by the six-layer fully convolutional neural network (FCNN6) with the ROC-AUC = 0.8590 ± 0.0011. Finally, a simple application is introduced as a concluding point of this thesis. This application is designed for testing individual neural network architectures on a user-inserted audio file. Overall results of this thesis are similar to other papers on the same topic, but this thesis brings several new findings and innovations. In terms of innovations, a significant reduction in the complexity of individual neural network architectures is achieved while maintaining similar results.
Automatic Delivery Note Transcription
Necpál, Dávid ; Kišš, Martin (referee) ; Hradiš, Michal (advisor)
This bachelor thesis aims to create a system for automatic transcription of delivery notes - documents with a fixed structure. The solution is divided into two parts. The first part is table lines detection and subsequent detection and extraction of cells, that contain required data. The second part is handwritten numeric characters recognition in the images of the cutted cells. The resulting system can detect cells with the required data with 100 % accuracy with well-scanned delivery notes, while the success rate of numerical character recognition is more than 95 % for individual characters and more than 92 % for entire character sequences. The benefit of this work is a system for automatic transcription of delivery notes, which provides faster and easier otherwise lengthy rewriting of the contents of delivery notes to the information system in the retail. By using this system, the employee saves more than 50 % of the time on each delivery note.
Automatic tagging of musical compositions using machine learning methods
Semela, René ; Galáž, Zoltán (referee) ; Kiska, Tomáš (advisor)
One of the many challenges of machine learning are systems for automatic tagging of music, the complexity of this issue in particular. These systems can be practically used in the content analysis of music or the sorting of music libraries. This thesis deals with the design, training, testing, and evaluation of artificial neural network architectures for automatic tagging of music. In the beginning, attention is paid to the setting of the theoretical foundation of this field. In the practical part of this thesis, 8 architectures of neural networks are designed (4 fully convolutional and 4 convolutional recurrent). These architectures are then trained using the MagnaTagATune Dataset and mel spectrogram. After training, these architectures are tested and evaluated. The best results are achieved by the four-layer convolutional recurrent neural network (CRNN4) with the ROC-AUC = 0.9046 ± 0.0016. As the next step of the practical part of this thesis, a completely new Last.fm Dataset 2020 is created. This dataset uses Last.fm and Spotify API for data acquisition and contains 100 tags and 122877 tracks. The most successful architectures are then trained, tested, and evaluated on this new dataset. The best results on this dataset are achieved by the six-layer fully convolutional neural network (FCNN6) with the ROC-AUC = 0.8590 ± 0.0011. Finally, a simple application is introduced as a concluding point of this thesis. This application is designed for testing individual neural network architectures on a user-inserted audio file. Overall results of this thesis are similar to other papers on the same topic, but this thesis brings several new findings and innovations. In terms of innovations, a significant reduction in the complexity of individual neural network architectures is achieved while maintaining similar results.

Interested in being notified about new results for this query?
Subscribe to the RSS feed.